Mapping the structure of research topics through term variant clustering: the TermWatch system
نویسندگان
چکیده
A multi-disciplinary approach integrating computational linguistic techniques is necessary to elaborate indicators of research topic evolution. We describe a system which bases clustering on linguistic relations, instead of the usual co-occurrence paradigm. The interesting features of this approach, embodied in the TermWatch system, lie in the combination of state-of-the-art techniques in computational terminology, mathematics (graph formalism) and visualization techniques. Computational terminology enable us to extract meaningful text chunks and to relate these chunks through linguistic relations. These text chunks are terms and the linguistic relations are syntactic variations. We integrated into this system an adapted visualization tool which enhances comprehension of the research topic layout and their trends. Here we focus on the chronological analysis of graphs issued by TermWatch through a graph visualization tool, Aisee which helps the end-user to track the main tendencies of research topics in his/her field.
منابع مشابه
تحلیل موضوعی مقالات مرتبط با اعتیاد در پایگاه مدلاین به روش خوشه بندی سلسله مراتبی: 2014-1991
Introduction: Addiction, which has recently attracted the attention of researchers, is a serious problem worldwide. The growth of relevant literature contributes to a better understanding of this problem and improves the interaction between executive organizations and academic institutions. It is important to identify the active subject areas within this field and to explore the topics which ar...
متن کاملText mining without document context
We consider a challenging clustering task: the clustering of multi-word terms without document co-occurrence information in order to form coherent groups of topics. For this task, we developed a methodology taking as input multi-word terms and lexico-syntactic relations between them. Our clustering algorithm, named CPCL is implemented in the TermWatch system. We compared CPCL to other existing ...
متن کاملVisualization of association graphs for assisting the interpretation of classifications
Given a query on the PASCAL database maintained by the INIST, we design user interfaces to visualize and browse two types of graphs extracted from abstracts: 1) the graph of all associations between authors (co-author graph), 2) the graph of strong associations between authors and terms automatically extracted from abstracts and grouped using linguistic variations. We adapt for this purpose the...
متن کاملQuery Refinement by Multi Word Term expansions and semantic synonymy
We developed a system, TermWatch (https://stid-bdd.iut.univ-metz.fr/TermWatch/index.pl), which combines a linguistic extraction of terms, their structuring into a terminological network with a clustering algorithm. In this paper we explore its ability in integrating the most promising aspects of the studies on query refinement: choice of meaningful text units to cluster (domain terms), choice o...
متن کاملGeochemical Analysis of Soil Physicochemical Properties and Heavy Metals Content in the Long- term Wastewater-irrigated Soils
Exploring the homogenous regions for site specific management is important, especially in the areas under different anthropogenic activities. This was investigated using multi-way analysis including Factor Analysis, Hierarchical Clustering Analysis and k means in the areas under long-term wastewater irrigation over a period of more than 40 years, in Shahre Rey, south of Tehran. By using Factor ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2004